INTRODUCTION

This is a project that involves analyzing a police data-set from Dallas in 2016. Which is available an the source from https://www.kaggle.com/datasets/center-for-policing-equity/data-science-for-good.

The data was cleaned by changing some variable structures, examples SUBJECT_DATE to a date format, time was formatted to represent the hour of the day. This analysis aims to look at the selected crimes description such as mentally unstable, Marijuana, Drugs, Alcohol and unknown drugs. I also looked at the subject race that was mostly arrested by various officers race that made most arrest. Further analysis was also done on time series, showing most crimes per hour, and month.

ANALYSIS

## Rows: 2,383
## Columns: 48
## $ INCIDENT_DATE                                <chr> "9/3/16", "3/22/16", "5/2~
## $ INCIDENT_TIME                                <chr> "4:14:00 AM", "11:00:00 P~
## $ UOF_NUMBER                                   <chr> "37702", "33413", "34567"~
## $ OFFICER_ID                                   <chr> "10810", "7706", "11014",~
## $ OFFICER_GENDER                               <chr> "Male", "Male", "Male", "~
## $ OFFICER_RACE                                 <chr> "Black", "White", "Black"~
## $ OFFICER_HIRE_DATE                            <chr> "5/7/14", "1/8/99", "5/20~
## $ OFFICER_YEARS_ON_FORCE                       <chr> "2", "17", "1", "24", "7"~
## $ OFFICER_INJURY                               <chr> "No", "Yes", "No", "No", ~
## $ OFFICER_INJURY_TYPE                          <chr> "No injuries noted or vis~
## $ OFFICER_HOSPITALIZATION                      <chr> "No", "Yes", "No", "No", ~
## $ SUBJECT_ID                                   <chr> "46424", "44324", "45126"~
## $ SUBJECT_RACE                                 <chr> "Black", "Hispanic", "His~
## $ SUBJECT_GENDER                               <chr> "Female", "Male", "Male",~
## $ SUBJECT_INJURY                               <chr> "Yes", "No", "No", "Yes",~
## $ SUBJECT_INJURY_TYPE                          <chr> "Non-Visible Injury/Pain"~
## $ SUBJECT_WAS_ARRESTED                         <chr> "Yes", "Yes", "Yes", "Yes~
## $ SUBJECT_DESCRIPTION                          <chr> "Mentally unstable", "Men~
## $ SUBJECT_OFFENSE                              <chr> "APOWW", "APOWW", "APOWW"~
## $ REPORTING_AREA                               <chr> "2062", "1197", "4153", "~
## $ BEAT                                         <chr> "134", "237", "432", "641~
## $ SECTOR                                       <chr> "130", "230", "430", "640~
## $ DIVISION                                     <chr> "CENTRAL", "NORTHEAST", "~
## $ LOCATION_DISTRICT                            <chr> "D14", "D9", "D6", "D11",~
## $ STREET_NUMBER                                <chr> "211", "7647", "716", "56~
## $ STREET_NAME                                  <chr> "Ervay", "Ferguson", "bim~
## $ STREET_DIRECTION                             <chr> "N", "NULL", "NULL", "NUL~
## $ STREET_TYPE                                  <chr> "St.", "Rd.", "Ln.", "Frw~
## $ LOCATION_FULL_STREET_ADDRESS_OR_INTERSECTION <chr> "211 N ERVAY ST", "7647 F~
## $ LOCATION_CITY                                <chr> "Dallas", "Dallas", "Dall~
## $ LOCATION_STATE                               <chr> "TX", "TX", "TX", "TX", "~
## $ LOCATION_LATITUDE                            <chr> "32.782205", "32.798978",~
## $ LOCATION_LONGITUDE                           <chr> "-96.797461", "-96.717493~
## $ INCIDENT_REASON                              <chr> "Arrest", "Arrest", "Arre~
## $ REASON_FOR_FORCE                             <chr> "Arrest", "Arrest", "Arre~
## $ TYPE_OF_FORCE_USED1                          <chr> "Hand/Arm/Elbow Strike", ~
## $ TYPE_OF_FORCE_USED2                          <chr> "", "", "", "", "Take Dow~
## $ TYPE_OF_FORCE_USED3                          <chr> "", "", "", "", "", "", "~
## $ TYPE_OF_FORCE_USED4                          <chr> "", "", "", "", "", "", "~
## $ TYPE_OF_FORCE_USED5                          <chr> "", "", "", "", "", "", "~
## $ TYPE_OF_FORCE_USED6                          <chr> "", "", "", "", "", "", "~
## $ TYPE_OF_FORCE_USED7                          <chr> "", "", "", "", "", "", "~
## $ TYPE_OF_FORCE_USED8                          <chr> "", "", "", "", "", "", "~
## $ TYPE_OF_FORCE_USED9                          <chr> "", "", "", "", "", "", "~
## $ TYPE_OF_FORCE_USED10                         <chr> "", "", "", "", "", "", "~
## $ NUMBER_EC_CYCLES                             <chr> "NULL", "NULL", "NULL", "~
## $ FORCE_EFFECTIVE                              <chr> " Yes", " Yes", " Yes", "~
## $ adj_date                                     <date> 2016-09-03, 2016-03-22, ~

Plots

The plot below shows the numbers 0f subject_race that was arrested the most by specific race of each officers, which tuned to be black with more number of arrest by white officers, followed by hisapnic, the plot also showed that Asians has the of both police officers and crime rate.

##2

Furthermore, I filtered subject description of crimes to Marijuana, Drugs, Mentally unstable and, alcohol and unknown drugs, the visualization shows that most blacks that are Mentally unstable as described by these officers were mostly arrested, followed by the whites and hispanic. The hispanic and hispanic race was arrested more with drugs, blacks more arrest with alcohol.

Plot 3

This plot shows the probability of an arrest made by subject description by officers.

Time Series

This plots shows date per number of crimes, according to the plot most crime was committed in October, followed by between February and March and also between June and July.

Time Series per hour

This plots shows the rate of crimes and arrest carried out during each period of the day. As seen most crimes and arrest are done in the late hours of the night. From 16:00-22:00, and a bit higher during the early hours of the day.

Conclusion

From the analysis above, blacks are more associated with crime in Dallas, even more, white officers are usually kin on arresting blacks, probably due to their high association with crimes, this should be subjected to further analysis.